Visualization for Coreference Annotation
نویسندگان
چکیده
The annotation of documents with linguistic information requires time-consuming and therefore expensive manual annotation. Especially, a complex task, like coreference resolution, needs large data sets for the training of supervised machine learning methods. We present a tool which combines visualization techniques and unsupervised machine learning to support the annotation of documents with coreference information. Self-organizing Maps are used to cluster similar data and visualize the feature space. For link visualization, precise annotation, and error correction a matrix-based coreference visualization is used which exploits the transitive property of the coreference relation.
منابع مشابه
A Coreference Corpus and Resolution System for Dutch
We present the main outcomes of the COREA project: a corpus annotated with coreferential relations and a coreference resolution system for Dutch. We discuss the annotation of the corpus: the type of annotated relations, the guidelines, the annotation tool and interannotator agreement. We also show a visualization of the annotated relations. The standard approach to evaluate a coreference resolu...
متن کاملCoreference Annotation Scheme and Relation Types for Hindi
This paper describes a coreference annotation scheme, coreference annotation specific issues and their solutions through our proposed annotation scheme for Hindi. We introduce different co-reference relation types between continuous mentions of the same coreference chain such as ‘Part-of’, ‘Function-value pair’ etc. We used Jaccard similarity based Krippendorff‘s’ alpha to demonstrate consisten...
متن کاملA Pilot Study on Computer-aided Coreference Annotation
We present the results of a pilot study on increasing the efficiency of coreference annotation by integrating the predictions of existing coreference components. While similar approaches are already quite common for other linguistic annotation tasks, our experiments are the first to address a more complex task such as coreference annotation.
متن کاملANALEC: a New Tool for the Dynamic Annotation of Textual Data
We introduce ANALEC, a tool which aim is to bring together corpus annotation, visualization and query management. Our main idea is to provide a unified and dynamic way of annotating textual data. ANALEC allows researchers to dynamically build their own annotation scheme and use the possibilities of scheme revision, data querying and graphical visualization during the annotation process. Each qu...
متن کاملWhat Is Coreference, And What Should Coreference Annotation Be?
In this paper, it is argued that 'coreference an-notation', as currently performed in the MUC community, goes well beyond annotation of the relation of coreference as it is commonly understood. As a result, it is not always clear what semantic relation these annotations are actually encoding. The paper discusses a number of interrelated problems with coreference annotation and concludes that re...
متن کامل